Learning about giant pumpkin contest winners.
Rows: 28,065
Columns: 14
$ id <chr> "2013-F", "2013-F", "2013-F", "2013-F", "2…
$ place <chr> "1", "2", "3", "4", "5", "5", "7", "8", "9…
$ weight_lbs <chr> "154.50", "146.50", "145.00", "140.80", "1…
$ grower_name <chr> "Ellenbecker, Todd & Sequoia", "Razo, Stev…
$ city <chr> "Gleason", "New Middletown", "Glenson", "C…
$ state_prov <chr> "Wisconsin", "Ohio", "Wisconsin", "Wiscons…
$ country <chr> "United States", "United States", "United …
$ gpc_site <chr> "Nekoosa Giant Pumpkin Fest", "Ohio Valley…
$ seed_mother <chr> "209 Werner", "150.5 Snyder", "209 Werner"…
$ pollinator_father <chr> "Self", NA, "103 Mackinnon", "209 Werner '…
$ ott <chr> "184.0", "194.0", "177.0", "194.0", "0.0",…
$ est_weight <chr> "129.00", "151.00", "115.00", "151.00", "0…
$ pct_chart <chr> "20.0", "-3.0", "26.0", "-7.0", "0.0", "-1…
$ variety <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
| Name | pumpkins |
| Number of rows | 28065 |
| Number of columns | 14 |
| _______________________ | |
| Column type frequency: | |
| character | 3 |
| factor | 6 |
| numeric | 5 |
| ________________________ | |
| Group variables | None |
Variable type: character
| skim_variable | n_missing | complete_rate | min | max | empty | n_unique | whitespace |
|---|---|---|---|---|---|---|---|
| id | 0 | 1 | 6 | 6 | 0 | 54 | 0 |
| grower_name | 0 | 1 | 4 | 79 | 0 | 7982 | 0 |
| country | 0 | 1 | 5 | 79 | 0 | 75 | 0 |
Variable type: factor
| skim_variable | n_missing | complete_rate | ordered | n_unique | top_counts |
|---|---|---|---|---|---|
| city | 2779 | 0.90 | FALSE | 3218 | Ste: 292, Nap: 183, Por: 178, St.: 162 |
| state_prov | 0 | 1.00 | FALSE | 188 | Oth: 2242, Ont: 2021, Wis: 1910, Cal: 1211 |
| gpc_site | 0 | 1.00 | FALSE | 220 | Ohi: 759, Wie: 749, Ear: 722, Bau: 548 |
| seed_mother | 8537 | 0.70 | FALSE | 9996 | unk: 277, Unk: 260, 214: 122, 200: 104 |
| pollinator_father | 10302 | 0.63 | FALSE | 4538 | ope: 2658, Ope: 2065, sel: 2020, Sel: 1875 |
| variety | 27341 | 0.03 | FALSE | 86 | Big: 349, Dom: 150, Del: 37, Big: 33 |
Variable type: numeric
| skim_variable | n_missing | complete_rate | mean | sd | p0 | p25 | p50 | p75 | p100 | hist |
|---|---|---|---|---|---|---|---|---|---|---|
| place | 2381 | 0.92 | 520.66 | 499.83 | 1.0 | 119 | 265.0 | 909.00 | 1798.0 | ▇▂▂▂▁ |
| weight_lbs | 5267 | 0.81 | 303.52 | 295.23 | 0.1 | 70 | 169.5 | 526.38 | 999.8 | ▇▂▂▂▂ |
| ott | 3211 | 0.89 | 202.46 | 154.89 | 0.0 | 0 | 233.0 | 338.00 | 1132.0 | ▇▇▁▁▁ |
| est_weight | 8233 | 0.71 | 273.48 | 314.90 | 0.0 | 0 | 135.0 | 518.00 | 998.0 | ▇▂▂▂▂ |
| pct_chart | 3211 | 0.89 | 0.45 | 17.06 | -100.0 | -3 | 0.0 | 3.00 | 830.0 | ▇▁▁▁▁ |
Ugh. We need to do some data cleaning.
state_prov
1498 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(91 exhibition only,\r\n 29 damaged)
151 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(4 exhibition only,\r\n 2 damaged)
1569 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(154 exhibition only,\r\n 24 damaged)
159 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(1 exhibition only,\r\n 2 damaged)
160 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(3 exhibition only,\r\n 2 damaged)
1681 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(108 exhibition only,\r\n 31 damaged)
1742 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(104 exhibition only,\r\n 46 damaged)
179 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(2 exhibition only,\r\n 2 damaged)
1798 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(99 exhibition only,\r\n 40 damaged)
185 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(4 exhibition only,\r\n 4 damaged)
1883 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(111 exhibition only,\r\n 37 damaged)
1900 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(128 exhibition only,\r\n 29 damaged)
1905 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(103 exhibition only,\r\n 43 damaged)
192 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(2 exhibition only,\r\n 3 damaged)
194 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(10 exhibition only,\r\n 4 damaged)
197 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(7 exhibition only,\r\n 4 damaged)
1980 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(150 exhibition only,\r\n 32 damaged)
200 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(7 exhibition only,\r\n 4 damaged)
203 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(11 exhibition only,\r\n 2 damaged)
206 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(3 exhibition only,\r\n 3 damaged)
206 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(4 exhibition only,\r\n 2 damaged)
219 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(6 exhibition only,\r\n 12 damaged)
219 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(7 exhibition only,\r\n 4 damaged)
226 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(14 exhibition only,\r\n 2 damaged)
227 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(4 exhibition only,\r\n 1 damaged)
246 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(18 exhibition only,\r\n 2 damaged)
253 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(18 exhibition only,\r\n 5 damaged)
254 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(9 exhibition only,\r\n 10 damaged)
256 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(11 exhibition only,\r\n 1 damaged)
271 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(22 exhibition only,\r\n 2 damaged)
272 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(15 exhibition only,\r\n 4 damaged)
273 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(22 exhibition only,\r\n 0 damaged)
275 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(13 exhibition only,\r\n 0 damaged)
278 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(19 exhibition only,\r\n 2 damaged)
283 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(27 exhibition only,\r\n 2 damaged)
289 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(26 exhibition only,\r\n 4 damaged)
290 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(15 exhibition only,\r\n 5 damaged)
291 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(16 exhibition only,\r\n 4 damaged)
292 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(9 exhibition only,\r\n 3 damaged)
294 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(23 exhibition only,\r\n 3 damaged)
300 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(14 exhibition only,\r\n 1 damaged)
314 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(21 exhibition only,\r\n 2 damaged)
314 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(31 exhibition only,\r\n 3 damaged)
316 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(24 exhibition only,\r\n 1 damaged)
319 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(29 exhibition only,\r\n 2 damaged)
326 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(29 exhibition only,\r\n 3 damaged)
328 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(29 exhibition only,\r\n 2 damaged)
330 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(24 exhibition only,\r\n 2 damaged)
334 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(27 exhibition only,\r\n 2 damaged)
364 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(49 exhibition only,\r\n 6 damaged)
366 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(38 exhibition only,\r\n 6 damaged)
370 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(27 exhibition only,\r\n 5 damaged)
383 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(47 exhibition only,\r\n 6 damaged)
451 Entries.\r\n\t\t\t\t\t\t\r\n\t\t\t\t\t\t\t(73 exhibition only,\r\n 13 damaged)
Alabama
Alaska
Alberta
Antwerp
Aragon
Arizona
Arkansas
Baden-Wuerttemberg
Basilicata
Basque Country
Bavaria
Berlin
Bern
Brandenburg
British Columbia
Burgenland
California
Campania
Carinthia
Catalonia
Central Finland
Colorado
Connecticut
Delaware
East Flanders
Emilia-Romagna
England
Flemish Brabant
Florida
Friesland
Galicia
Gelderland
Georgia
Graubuenden
Greater Poland
Hawaii
Hesse
Idaho
Illinois
Indiana
Iowa
Kansas
Kentucky
Kymenlaakso
La Rioja
Lapland
Lazio
Lesser Poland
Limburg
Lombardy
Louisiana
Lower Austria
Lower Saxony
Lower Silesian
Lubusz
Maine
Manitoba
Maryland
Masovian
Massachusetts
Mecklenburg-Vorpommern
Michigan
Minnesota
Mississippi
Missouri
Montana
Navarre
Nebraska
Nevada
New Brunswick
New Hampshire
New Jersey
New Mexico
New York
North Brabant
North Carolina
North Dakota
North Holland
North Karelia
North Rhine-Westphalia
Northern Savonia
Nova Scotia
Ohio
Oklahoma
Ontario
Opole
Oregon
Other
Overijssel
Paijanne Tavastia
Pennsylvania
Piedmont
Pirkanmaa
Podlaskie
Pomeranian
Prince Edward Island
Quebec
Rhineland-Palatinate
Rhode Island
Saarland
Sardinia
Saskatchewan
Satakunta
Saxony
Saxony-Anhalt
Silesian
South Carolina
South Dakota
South Holland
Southern Savonia
Styria
Subcarpathian
Tavastia Proper
Tennessee
Texas
Thuringia
Tirol
Tuscany
Umbria
Upper Austria
Utah
Utrecht
Uusimaa
Valencian Community
Veneto
Vermont
Vienna
Virginia
Vorarlberg
Washington
West Virginia
Wisconsin
Wyoming
Zeeland
n percent
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
1 3.563157e-05
75 2.672368e-03
61 2.173526e-03
300 1.068947e-02
232 8.266524e-03
12 4.275788e-04
12 4.275788e-04
131 4.667736e-03
129 4.596472e-03
1 3.563157e-05
2 7.126314e-05
144 5.130946e-03
1 3.563157e-05
12 4.275788e-04
657 2.340994e-02
183 6.520577e-03
5 1.781578e-04
1211 4.314983e-02
36 1.282737e-03
23 8.195261e-04
46 1.639052e-03
51 1.817210e-03
604 2.152147e-02
459 1.635489e-02
5 1.781578e-04
31 1.104579e-03
142 5.059683e-03
231 8.230893e-03
39 1.389631e-03
1 3.563157e-05
6 2.137894e-04
1 3.563157e-05
8 2.850526e-04
29 1.033316e-03
2 7.126314e-05
8 2.850526e-04
3 1.068947e-04
57 2.030999e-03
42 1.496526e-03
192 6.841261e-03
674 2.401568e-02
599 2.134331e-02
78 2.779262e-03
345 1.229289e-02
12 4.275788e-04
8 2.850526e-04
1 3.563157e-05
4 1.425263e-04
15 5.344735e-04
44 1.567789e-03
166 5.914841e-03
25 8.907892e-04
382 1.361126e-02
11 3.919473e-04
7 2.494210e-04
2 7.126314e-05
332 1.182968e-02
262 9.335471e-03
7 2.494210e-04
1 3.563157e-05
415 1.478710e-02
8 2.850526e-04
1128 4.019241e-02
708 2.522715e-02
3 1.068947e-04
165 5.879209e-03
35 1.247105e-03
75 2.672368e-03
69 2.458578e-03
7 2.494210e-04
211 7.518261e-03
219 7.803314e-03
22 7.838945e-04
3 1.068947e-04
802 2.857652e-02
15 5.344735e-04
427 1.521468e-02
42 1.496526e-03
16 5.701051e-04
34 1.211473e-03
302 1.076073e-02
2 7.126314e-05
1049 3.737752e-02
1190 4.240157e-02
61 2.173526e-03
2021 7.201140e-02
5 1.781578e-04
873 3.110636e-02
2242 7.988598e-02
25 8.907892e-04
9 3.206841e-04
992 3.534652e-02
69 2.458578e-03
1 3.563157e-05
4 1.425263e-04
2 7.126314e-05
24 8.551577e-04
457 1.628363e-02
137 4.881525e-03
308 1.097452e-02
15 5.344735e-04
1 3.563157e-05
15 5.344735e-04
1 3.563157e-05
173 6.164262e-03
99 3.527525e-03
21 7.482630e-04
9 3.206841e-04
201 7.161945e-03
61 2.173526e-03
2 7.126314e-05
26 9.264208e-04
6 2.137894e-04
6 2.137894e-04
251 8.943524e-03
10 3.563157e-04
175 6.235525e-03
1 3.563157e-05
144 5.130946e-03
16 5.701051e-04
36 1.282737e-03
518 1.845715e-02
23 8.195261e-04
53 1.888473e-03
9 3.206841e-04
11 3.919473e-04
381 1.357563e-02
56 1.995368e-03
146 5.202209e-03
4 1.425263e-04
1118 3.983609e-02
89 3.171210e-03
1910 6.805630e-02
85 3.028683e-03
3 1.068947e-04
Let’s filter out all of those entries that have damaged as state_prov. This should simplify our summaries.
[1] Wisconsin California Ohio Michigan Washington
[6] Pennsylvania Oregon New York Minnesota Indiana
134 Levels: Alabama Alaska Alberta Antwerp Aragon Arizona ... Zeeland
Let’s visualize the top 10 states by median pumpkin weight in these contests. California has the highest median pumpkin weight of these states.
Let’s separate the pumpkins out by type, and compare the distributions over the years for United States, United Kingdom and Canada. You can click on the legend to remove or show the different countries.
For attribution, please cite this work as
Laderas (2021, Oct. 22). Edward Hillenaar, MSc, candidate PhD: Pumpkins, Pumpkins, Pumpkins. Retrieved from https://laderast.github.io/articles/2021-10-19-great-pumpkins/
BibTeX citation
@misc{laderas2021pumpkins,,
author = {Laderas, Ted},
title = {Edward Hillenaar, MSc, candidate PhD: Pumpkins, Pumpkins, Pumpkins},
url = {https://laderast.github.io/articles/2021-10-19-great-pumpkins/},
year = {2021}
}